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(54) Digital video decoder for decoding digital high definition and/or digital standard definition 
television signals 



(57) Methods and apparatus for reducing the com- 
plexity of decoder circuitry and video decoder memory 
requirements are disclosed. The discribed video decod- 
ers are capable of decoding HDW and/or SDTV pic- 
tures. The described video decoder may be used as part 
of a picture-in-picture decoder circuit for providing ple- 
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ture-in-picture capability wrthout providing multiple full 
resolution video decoders. The reduction in decoder cir- 
cuit complexity is achieved through the use of a plurality 
of data reduction techniques including the use of a pre- 
parser. dcwnsanpling, arid truncating pixel values. 
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Description 

Field of the Invention 

The present invention is directed to video decoders 
and, more particularly, to methods and apparatus for 
implementing video decoders that are capable of decod- 
ing high definition television fHOTV**) signals and/or 
standard definition television ("SDTV") signals. 

p^C^ ground of the Invention 

The use of digital, as opposed to analog signals, for 
television broadcasts and the transmission of other types 
of video and audio signals has been proposed as a way 
of allowing irrproved picture quality and more efficient 
use of spectral bandwidth over that currently possible 
using analog NTSC television signals. 

The International Standards Organization has set a 
standard for video data compression for generating a 
conpressed digital data stream that is expected to be 
used for digital television. This standard is refenred to as 
the ISO MPEG (International Standards Organization - 
Moving Picture Exerts Group) CMPEG") standard. In 
accordance with the MPEG standard, video data is 
encoded using discrete cosine transform encoding and 
is arranged into variable length encoded data packets for 
transmission. 

One version of the MPEG standard. MPEG-2. is 
described in the international Standards Organization - 
Moving Picture Experts Group. Drafts of Recommenda- 
tion H.262. ISO/iEC 13818-1 and 13818-2 titled "Infor- 
mation Tecfinology - Generic Coding Of Moving Pictures 
and Associated Audio" (hereinafter "the November 1993 
ISO-MPEG Committee draft") hereby expressly incorpo- 
rated by reference. Any references made in this patent 
application to MPEG video data is to be understood to 
refer to video data that complies with MPEG-2 standards 
as d^ined in the November 1 993 ISO-MPEG Committee 
drafts. 

MPEG video data may be used to support either 
high definition television ("HDTV"), wherein the video 
frames are of higher resolution than those used in 
present NTSC signals, or what will be refen-ed to as 
standard definition television ("SDTV"). e.g.. television 
whi<^ has approximately the same resolution per frame 
as the existing analog NTSC standard. Because HDTV, 
which includes the proposed United States Advanced 
Television Standard ("US ATV^. provides higher resolu- 
tion than SDTV. more data is required to represent a 
HDTV frame than is required to represent a SDTV frame. 
Accordingly, it is possible to transmit nrujltiple SDTV sig- 
nals in the same bandwidth required to support a single 
HDTV signal. 

MPEG Main Prcrfile at Main Level ("MP@ML-) spec- 
ifies various requirements for an MPEG conr^iant stand- 
ard definition television signal and associated decoding 
equipment. MP@ML allows pictures as large as 
720x576 pels for a total of 41 4.720 pels per picture. The 



proposed standard for US ATV allows for pictures as 
large as 1920 X 1080 pels for a total of 2.073.600 pels 
per picture. 

Because of the relatively large anrrount of data 
5 required to represent each frame of a HDTV picture, 
HDTV decoders must support much higher data rates 
than SDTV decoders. The additional memory required 
by a HDTV decoder, as compared to a standard SDTV 
decoder, and the increased complexity associated with 
10 the inverse DCT circuit and other components of a HDTV 
decoder can make a HDTV decoder considerably more 
expensive than an SDTV decoder. 

In fact, the cost of memory alone may make a HDTV 
set incorporating a HDTV decoder prohibitively expen- 
75 si ve for some consumers. It is expected that a fully MPEG 
compliant video decoder for HDTV will require a mini- 
mum of 1 0 MB of RAM for frame storage with a practical 
HDTV decoder probably requiring about 16 MB of rela- 
tively expensive Syrxjhronous DRAM. 
20 Accordingly, there is a need for a method and appa- 
ratus that permits: (1) a simplification of the complexity 
of the circuitry required to implement a HDTV decoder, 
(2) a reduction in the anrxxint of memory required to 
inclement a HDTV decoder circiit. and (3) a single 
25 decoder that is capable of decoding both SDTV and 
HDTV signals. Furthernrwe. it is desirable that the cost 
of such a decoder be low erxxigh that it is in a range that 
would be acceptable to nxjst consumers, e.g., approxi- 
mately the cost of a SDTV decoder. 
30 While various proposals call for transmitting only US 
ATV signals it has also been suggested that some digital 
SDTV signals be broadcast Various combinations of 
broadcasting HDTV and SDTV signals are possftDle with 
multiple SDTV shows being broadcast during a particu- 
35 lar time of the day and a single HDTV broadcast being 
transmitted in the same bandwidth used for tfie SDTV 
signals at a different time of the day 

Thus, in order to remain compattole with both HDTV 
and SDTV broadcasts, there would be value in a televi- 
40 sion receiver include a video decoder capable of decod- 
ing txrth HDTV and SDTV signals and furthwiiore. that 
such a video decoder be capable of being implemented 
at a relatively low cost 

In addition, there is a need for a method and appa- 
45 ratus for implementing picture-in-picture capability in a 
digital television without incurring the cost of multple f Jl 
resolution decoders. In known analog picture-in-picture 
systems, during picture-in-picture operation, and full res- 
olution decoder is normally used to decode the TV signal 
so used to produce a main picture and a second full reso- 
lution decoder is used to decode the television signal 
which is used to provide the second picture displayed 
within a small area of the main picture. 

55 Summary of the Prese nt Invention 

The present invention is directed to video decoders 
and. more particularly, to methods and apparatus for 
implementing video decoders that are capable of decod- 
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ing high definition television ("HOTV") signals and/or 
standard definition compressed video signals. 

In particular the present invention is directed to a plu- 
rality of methods and apparatus for reducing the com- 
plexity of digital video decoder circuitry and for reducing 
video decoder memory requirements. 

A HDTV decoder capable of decoding HDTV signals 
at less than full HDTV resolution, e.g.. at approximately 
SDTV resolution and/or decoding SDJV signals is 
desaibed. It is expected that as a result of the various 
data reduction features and circuit simplifications of the 
present invention a combined HDTV/SDTV decoder that 
outputs pictures at approximately SDTV resolution can 
be txjilt at approximately 130% of the cost of a digital 
SDTV decoder. 

In accordance with one embodiment of the present 
invention the described reduced resolution HDTV video 
decoders are used as part of a picture-in-picture decoder 
circuit for providing picture-in-picture capability without 
the use of multiple full resolution video decoders. 

The decoder of the present invention achieves sub- 
stantial reductions in circuit conplexity and memory 
requirements through the use of a plurality of data reduc- 
tion techniques including the use of a preparser. down- 
sanpling. and truncating bytes of data representing pixel 
values so tfiat the pixel values are represented using only 
5. 6 Of 7 bits as opposed to. e.g.. 8 bits per DCT value. 

In accordance with one embodiment off the present 
invention, the preparser is used to discard data to permit 
a smaller channel buffer than would otherwise be 
rec^ired to be used. In addition, the preparser discards 
run-length/amplitude symbols for each rnacroblock when 
the nunrtoer of runrlength/amplitude s^bols exceeds a 
maximum preselected nurrtoer per macroWock In one 
entKxJiment the preparser also limits the numbe^pf bits 
output per unit time period, e.g.. by limiting the number 
of bits per nriacroWock. such that a predetermined data 
processing rate >Attich is determined .by the combined 
capability of the coded data buffer and syntax parser and 
variable length decoder circuit is not ^xceeded. In this 
rT»anner, the present invention permits a sirrpler real- 
time syntax parser and variable length decoder drcurt to 
be used than would be possible without such preparsing. 

In accordance with the present invention, an inverse 
quantizer circuit and an inverse discrete cosine trans- 
form circuit are designed to process only a preselected 
set of OCT coefficients con-esponding to each macrob- 
lock arid to set the rest of the coefficients, e.g.. the DCT 
coefficients removed by the preparser, to zero. In this 
manner the computational requiremients of the inverse 
quantizer circuit and inverse discrete cosine transform 
circuit are substantially reduced allowing for simpler cir- 
cuit implementation. 

In addition to the use of the preparser. the use of a 
downsanrpler and/or a least significant bit truncation cir- 
cuit provide for further reductions in the amount of data 
used to represent each video picture thereby resulting in 
a significant reduction in frame buffer memory require- 
ments. 



In order to compensate for the effect of data reduc- 
tion, including downsampling and data elimination, the 
decoder of the present invention may include low pass 
filters. 

The atx)ve described features and errtoodiments of 
the present invention along with numerous other features 
and entKxliments are described below. 

Brief Description of the Drawings 
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Figure 1 illustrates a video decoder in accordance 
with one exemplary embodiment of the present inven- 
tion. 

Figure 2A illustrates a video decoder in accordance 
15 with a second exemplary embodiment of the present 
invention. 

Figure 2B illustrates a video decoder circuit capable 
of decoding both HDTV and SDTV video signals in 
accordance with the present invention. 
20 figures 3A through 3E illustrate 8x8 blocks of DCT 
coefficients corresponding to a HDTV rnacroblock in 
accordance with different embodiments of the present 
invention. 

Rgure 4 illustrates an exemplary circuit for decoding 
25 video images to support picture-in-picture television 
operation in accordance with the present invention. 

Detailed Description 

30 The present invention is directed to video decoders 
and, more particularly, to a low cosX video decoder that 
is capable of decoding high definition and/or starxlard 
definition digital video signals. In accordance with the 
present invention decoded HD pictures will be displayed 

35 with less than fiili HD resdutionV e.g.. at approximately 
SD resolution, while SD pictures will be displayed at full 
SD resolution. The decoder of the present invention can 
be implemented as a iow cost HDTV decoder or as a 
joint HD/SD television decoder. While the detailed 

40 description of the preserrt invention is directed primarily 
to exenplary embodiments off HDTV. SDTV and com- 
bined HDTV/SDTV decoders, it is to be understood that 
the numerous features of the preserrt invention may be 
used to provide low cost reduced resolution digital video 

45 decoders suitable for decoding digital pictures in general , 
and that this disclosure is therefore not limited solely to 
SDTV and HDTV decoders. Furthermore, while the 
exenplary embodiments are described as MPEG com- 
patible decoders it shouW be noted that the decoders of 

50 the present inventkwi can be irnplemented to decode a 
plurality off MPEG like video data streams that use dis- 
aete cosine transform data encoding. 

In the case of television sets with relatively small 
screens. e.g.. less than 20 iiwh screer^. the difference 

55 in perceived resolution and pk:ture quality t)etween a 
HDTV signal and an SDTV signa) will be barely noticea- 
ble. Accordingly, low cost HDTV decoders in accordance 
with the present invention which provide less than full 
HDTV picture resolution and quality are particularly well 
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suited for such applications because of their cost advan- 
tages. The decoder of the present invention is also well 
suited for providing picture-in -picture capability because 
of the small display area of the second picture being dis- 
played and the relatively low cost of the decoder of the 
present invention as compared to the cost of a full HDTV 
decoder. 

The method of the present invention for decoding 
HD and SD pictures includes the steps of reducing the 
resolution of received HD pictures prior to decoding by 
using, e.g.. a preparser unit artd/or adaptive field/frame 
downsampling to reduce the corrplexity of later process- 
ing stages of the decoder. In addition, by using an inverse 
discrete cosine transform ("IDCT") circuit designed to 
maximize the amount of common circuitry used for 
processing both reduced resolution HD and SD pictures 
the cost of implementing a combined HD/SD decoder cir- 
cuit is reduced. 

Referring now to Figure 2, there is illustrated a video 
decoder generally indicated by the reference numeral 
100. irrplemented in accordance with one embodiment 
of the present invention. The illustrated decoder 100 is 
capable of decoding HD and/or SD television signals, 
e.g.. MPEG compliant television signals. When used as 
a HD decoder, the decoder circuit 100 can produce 
images from a HDTV signal that are comparable in qual- 
ity or slightly better than SD television images. 

The SD/HD video decoder circuit 100 comprises a 
preparser 112, a syntax parser and variable length 
decoder droit 120, an inverse quantizer 122. an inverse 
DCT circuit 124. a down sampler 126. a summer 128, a 
switch 129, a post processor 1 36. a motion compensated 
prediction ("MCP*) circuit 130. and a memory circuit 114 
including a coded data buffer and a frame buffer 118. 

Methods of reducing the overall cost of a HDTV 
decoder will now be described with reference to the 
ennbodiment of Fig. 1 . 

Generally, the single most expensive element of a 
video decoder, in terms of cost, is the frame memory 118 
which nrray comprise, e.g.. 16 MB of synchronous DRAM 
in a HD decoder. The coded data buffer 116. which is 
used for the temporary storage of the compressed bit- 
stream represents a smaller but not insignificant cost A 
fufly MPEG compliant HDTV decoder is expected to 
require at least 1 MB of RAM for use as a coded data 
Ixjffer. 

Other elemerrts of a decoder which add significantly 
to the cost of the decoder are the inverse discrete cosine 
transform circuit ("IDCT") 124 and the inverse quantizer 
("IQT circuit 122. The IDCT circuit 124 of a HDTV 
decoder is required to perform a large number of arith- 
metic conrputations at a high rate and therefore is likely 
to represent a significant portion of a decoder's drojitry. 
The IQ circuit 1 22 performs a smaller number of compu- 
tations than the IDCT circuit 1 24 but t>ecause of the high 
speed and corrplexity of the computations, the cost of 
the IQ circuit 122 may also be a significant component 
of a HDTV decoder's overall cost. 



In addition to the IDCT circuit 124 and the IQ circuit 
122 nx)tion conrpensation and interpolation logic may 
require a significant amount of logic gates, e.g.. the 
motion corrpensation and interpolation logic may require 

5 more logic gates than the IQ circuit to implement txjt less 
than the IDCT circuit. The syntax parser and variable 
length decoder circuit 120 may also require a significant 
amount of logic gates to implement 

Because the cost arxJ complexity of a HDTV 

10 decoder is largely a function of the requirement that it 
process large amounts of data on a real time basis, it is 
possit)le to reduce the complexity and thus the cost of a 
HDTV compatil)le decoder by reducing the amount of 
data that needs to be processed. While using only a 

15 small portion of the video data received in an HDTV sig- 
nal will result in reduced resolution and picture quality, 
by carefully selecting which HDTV data to process and 
the method by which it is processed, video image quality 
comparable to or better than SD television signals can 

20 be achieved. As will be discussed below, the preparser 
1 1 2 serves to dynamically limit the anrx>unt of video data 
supplied to the ren»ining elements of the decoder circuit 
100 including the syntax parser and VLD circuit 120 
thereby reducing the amount of data that must be proe- 
ms essed by the subsequent circuit elements on a real tin^ 
basis and the required corrplexity of those circuit ele- 
ments. An additional benefit of the use of the preparser 
112 is that it permits for the use of a smaller coded data 
buffer 116 than would olhenwse be required. 

30 As illustrated in Rg. 2, the preparser 1 1 2 has a first 
input for receiving the vsnakAe length encoded bitstream 
representing, e.g., a HDTV signal output by a transport 
demultiplexer/decoder circuit of a receiver, and a second 
input for receiving data rate control infornr^tion. The out- 

35 put of the preparser 1 12 is coupled to the input of the 
coded data buffer 116 which has a rate control output 
coupled to the second input of the preparser 1 1 2. 

The preparser 1 12 operates to reduce the amount 
of HDTV data sipplted to the coded data buffer. It does 

40 this by selectively discarding some of the received HDTV 
data. By using the preparser 1 12 to parse and discard 
data prior to variable length decoding it is posstt)le to 
operate the preparser 1 12 at adata rate that is a function 
of the data rate of the received bitstream. This rate will 

45 be considerably lower man the data rate that the corre- 
sponding circuitry wouW have to be capable of support- 
ing if it were located after the variable length decoder 
circuit 120. By preparsing the data prior to storing in the 
coded data buffer 116 and syntax parsing and variable 

50 length decoding, the need to support the worst case 
processing requirements that might otherwise be 
encountered by the SP and VLD circuit 120 is avoided. 
Sir>ce the nrmimum data rate. e.g., the wor^ case data 
rate, that the SP and VLD drcwt 120 would normally 

55 have to be designed to support can t>e as much as 60 
times higher than the average data rate, a substantial 
reduction in the complexity of the SP and VLD circuit 120 
can be achieved by using the preparser 112 to insure 
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that support for what would normally be the worst case 
variable length decoding scenario need not be provided. 

The preparser 112 parses the incoming bitstream. 
without performing a complete variable length decode 
operation, to identify data corresponding to different 5 
types of video frames, such as bi-directionally coded ("B- 
") frames, predictively coded video frames ("P-") frames 
and intra-coded ("I-") frames, and to identify MPEG cod- 
ing elements such as maaoblocks including motion vec- 
tors and OCT coefficients associated with each of the io 
identified frames in the received data stream. 

The preparser 1 12 performs several data reduction 
operations on a, HDTV bitstream as required to insure 
that the processing capacity of the SP and VLD circuit 
120 and throughout capability of the coded data buffer is 
1 1 6 are not exceeded. Thus, the preparser 1 1 2 operates 
to insure that the preparser output data rate stays within 
a real time decoding envelope established by the decod- 
ing limits of the other components of the decoder 100. 
By reducing the data rate as required, the preparser 112 20 
also serves to reduce the coded data buffer size require- 
ments because the coded data buffer 116 need only be 
capable of storing a reduced anxjunt of data. 

In one en±KXiiment. to insure that the preparser out- 
put data rate stays within the preselected limits of the 25 
decoding envelope, the preparser 1 12 limits the number 
of OCT coefficients used to represent a macroblock to a 
preselected nunrtjer and also limits the nunrtoer of bits 
per inacroblock by removing the highest order DCT coef- 
ficients associated with a received macroblock until the so 
desired nun*>er of bits per macroblock required to pre- 
vent the coded data buffer capacity and decoding limits 
from being exceeded is achieved. Both the number of 
DCT coefficients per macroblock and the number of bits 
per macroblock can be preselected to approximately 3S 
achieve the desired data rate or can be dynamically 
selected by the preparser 112. 

Thus, in one embodiment, to limit its output data rate 
to a level required to prevent the coded data buffer 116 
from overflowing, the preparser 1 1 2 limits the number of 40 
DGT coefficients which are used to represent each mac- 
roblock by discarding the higher frequency DCT coeffi- 
cients which exceed a preselected number of DCT 
coefficients. In such an enrtKxSment renx)val of AC DCT 
coefficients may be performed preferentially on B- 45 
frames, then P- frames and then I- frames, with a pref- 
erence for retaining data in earlier P- frames as opposed 
to later P- frames as measured from the most recently 
received I- frame, as required to achieve the desired 
reduction in the preparser output data rate. so 

In DCT based video compression systems the DCT 
coefficients are naturally ordered in two dimensions 
according to vertical and horizontal frequency bins. In 
typical systems the coefficients are assigned a one- 
dimensional ordering to allow for sequential transmis- 55 
sion. MPEG allows for two different orderings, generally 
called zig-zag scan and alternate scan. 

It may be the case that the subset of DCT coeffi- 
cients which are preserved, i.e., the DC DCT coefficients 



and lower frequency AC DCT coefficients not discarded 
by the preparser 1 12 may not be consistent with any lin- 
ear range of coefficients in either of the MPEG scanning 
types. In such a case, it is difficult to limit the DCT coef- 
ficients to the desired set by discarding all of the coded 
coefficients of a macroblock after a certain point in the 
data stream- 
In accordance with the present invention, two differ- 
ent approaches are used to insure that the preparser 1 1 2 
discards the preselected DCT coefficients. In accord- 
ance with a first embodiment, the preparser 1 1 2 chooses 
the coefficient index in scan order after which to discard 
coefficients- In this embodiment the retained set of DCT 
coefficients will initially include all coefficients which do 
belong to the desired set as well as some coefficients 
which do not betong to the desired set. The preparser 
1 12 then sends only the DCT coefficients which belong 
to the desired set to the coded data buffer 116 to be 
passed to the SP and VLD circuit 120. 

In a second embodiment, the preparser 1 1 2 recedes 
the incoming bitstream. This is done by removing run 
length/anplitude codes which refer to urKlesired coeffi- 
cients, and by then receding the later desired coefficients 
with suitably modified run lengths. 

By limiting the number of DCT coefficients per vnac- 
roblock, the preparser 112 effectively discards run- 
length/anrplitude symbols which effectively cause zero 
DCT coefficient values thus allowing for a simpler IpCT 
circuit 124 to be used, in some embodiments; as 
opposed to the IDCT circuit that woukJ be required to fully 
decode a HDTV bitstream. 

The discarding of run-length/amplitude syrrbols 
also allows for a simpler syntax parser and variable 
length decoder circuit 120 than woukJ other wise be 
required because the use of the preparser 112 insures 
that no nrore than the preselected number of run- 
length/amplitude pairs per macroblock will be output by 
the preparser 1 12 for further processing. 

Yet another benefit of the discarding of some DCT 
coefficients, i.e.. the effective setting of such coefficients 
to zero, is that the preparser 1 12 effects lowpass f iltering 
on a macrobtock t«sis thereby reducing or eliminating 
the need for additional filtering before a downsampling 
operation, which will be descr4>ed l>elow. is performed. 

In one particular embodiment the preparser 112 
also operates to detect and eliminate data from the 
received bitstream which will have no impact on ultimate 
pkrture quality. In one such embodiment, the preparser 
1 12 deletes from the data stream such MPEG data as 
bit slice stuffing and start code zero stuffing to thereby 
reduce the data rate without irrpacting picture quality. By 
eliminating such winecessary data, the preparser output 
data rate and the size of the coded data buff©- 1 1 6 can 
be reduced in this manner with no impact on picture qual- 

In one embodiment of the present invention, the pre- 
parser 1 12 dynamically adjusts the amount of data dis- 
carding performed as a function of the amount of 
memory available in the coded data buffer 11 6 for storing 
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the received data. In order to facilitate such dynamic data 
discarding the coded data buffer 1 1 6 includes a data rate 
rronitoring circuit which generates a signal SL^Iied to 
the second input of the preparser 112 indicating how 
much data is stored at any given time in the coded data 
buffer 116. 

In such an embodiment, the preparser 112 main- 
tains a model of coded data buffer 1 16 occupancy and 
works to ensure that the buffer 1 16 will never overflow. 
As buffer occupancy inaeases as indicated by the signal 
received from the buffer 1 1 6, the preparser 1 1 2 becomes 
more severe in the anrx>unt of high frequency information 
that is discarded further reducing the number of OCT 
coefficients per macroWock from a preselected n^i- 
nruim number to a lower number. 

The preparser 112 can look to buffer occupancy 
characteristics of the received bitstream. e.g., the man- 
ner in which the video data was originally encoded for 
data transmission, and use this information as a factor 
in determining wtiat information should be discarded to 
achieve the required reduction in the data rate. 

As one feature of the dynamic control of its output 
data rate and thus buffer control, the preparser 1 1 2 com- 
putes and stores the number of bits used to code each 
row of macrobiocks that comprise a picture as the data 
is received. While this number may vary dramatically 
from row to row and picture to picture, since the decoder 
will take roughly the same anxHint of time to process 
each row or macrobiocks regardless of the number of 
bits used to represent the row of macrobiocks. the pre- 
parser can predict the rate of code buffer emptying. 
When this rate is expected to be relatively fast. e.g.. 
because a large number of bits are used to represent the 
preceding rows of macrobiocks, the preparser 112 can 
discard less data from the received data stream without 
concern of the coded data buffer 116 overflowing. How- 
ever, when relatively few bits are used to code the mac- 
robiocks represerrting the preceding rows of 
macrobiocks. the coded data Ixjffer 116 will empty on a 
conparatively sk)w basis requiring the preparser 1 12 to 
discard more data from the received data stream until 
the coded data buffer 116 becomes less occupied. 

In one embodiment, to minimize implementation 
costs, the preparser 1 12. which may also be referred to 
as a bitstream preprocessor, is designed to run at the 
incoming bit rate and is implemented using a bit serial 
architecture. Alternatively, the preparser 1 12 can parse 
an entire token at once, and at a rate that never needs 
to exceed the data rate of the received variable length 
encoded bitstream divided by the minimum length of a 
symbol in the bitstream. 

As will be discussed below, when the decoder circuit 
100 is used as a joint reduced cost HDTV decoder and 
SDTV decoder, the data throughput is selected to permit 
the passing of a full rate SDTV tntstream. 

The coded data buffer 116. which is coupled to the 
preparser 1 12. in the above described manner, is used 
to tenporarily store the variable length encoded data 
output by the preparser 112 urrtil the synteix parser and 



variable length decoding circuit 120 is ready to accept 
the video data for processing. 

The syntax p>arser arxl variable length decoder cir- 
cuit 120 has an input coupled to a data output of the 

£ coded data buffer 1 16 to receive therefrom the variable 
length encoded data temporarily stored therein. The syn- 
tax parser and variable length decoder ("SP and VLD") 
circuit 120 decodes the received data to a constant 
length bit stream. 

10 By using the preparser 1 12 in the above described 
manner, the conrputation requirements of the SP arxJ 
VLD circuit 120 are substantially reduced as compared 
to the case where all the received data is syntax parsed 
and variable length decoded. The preparser 1 12 effec- 

15 tively limits the number of DCT coefficients which must 
be variable length decoded for any given macroWock by 
discarding coefficients exceeding the preselected maxi- 
mum number permitted per macroWock thus reducing 
the burden on the real time SP and VLD circuit 120 by 

20 putting a dear limit on the amount of data that will have 
to be processed for each macroWock. 

The output of the SP and VLD circuit 120 is coupled 
to the input of the inverse quantizatfon ("IQ*) circuit 122 
which, in turn, has an output coupled to the input of the 

25 IDCT circuit 124. 

Reduced complexity IQ and IDCT circuits 122 and 
124 are implemented in accordance with the present 
invention by intentionally setting preselected AC DCT 
coefficients corresponding to each macroWock to zero. 

30 The IQ circuit 1 22 can be designed to preform this func- 
tion. 

The IQ circuit 122 receives the data output by the 
SP and VLD circuit 120 and operates to generate DCT 
values of zero for preselected DCT coefficients regard- 

35 less of the input value of the preselected DCT coeffi- 
cients. In this nrianner. the IQ circuit need not actually 
process the preselected DCT coefficients received at its 
input but only the remaining DCT coefficients. 

In one exemplary embodiment, the IQ circuit 122 

40 sets all but the ipper left 4X4 region of DCT coefficients 
of a set of 8X8 DCT coefffciertts representing a HDTV 
macroWock to zero. Fig. 3A illustrates the resulting effect 
on an 8x8 Wock of OCT coefficients representing a mac- 
roWock. In the illustration of Fig. 3. Xs are used to repre- 

45 sent DCT values that are actually processed by the IQ 
circuit 122. whUe as Ulustrated all the other DCT values 
are set to zero. 

Accordingly, in such an enrfoodiment the IQ circuit 
122 need only include circuity for performing inverse 

50 quantization on 16 of the possiWe 64 DCT coefficients 
corresponding to each macroWock and circuitry for out- 
putting zeros for the remaining DCT coefffoient values. 
In this manner, the rwerse quantization computation 
requirement is reduced by a factor of four. The complexity 

55 of the IDCT droit 124 \s reduced in a similar manner 
since a preselected number of the DCT values will be set 
to zero by the IQ circuit 122. 

MPEG permits the use of either interfaced pictures 
or non-interlaced pictures. When encoding video data as 
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interlaced pictures, the OCT operation may have been 
performed on either a field or frame OCT basis. When 
interlaced video data is encoded on a frame DCT tjasis 
the high frequency coefficients represent temporal as 
well as spatial information. In such a case it may be desir- 
able to maintain high vertical frequencies to maximize 
the presen/ation of temporal information. 

Accordingly, in one exemplary embodiment in which 
it is expected that some interlaced pictures encoded on 
a frame DCT basis will be received, the IQ circuit 120 is 
designed to force to zero all DCT coefficients except the 
DCT coefficients corresponding to rows 0, 1 . 6 and 7 and 
columns 0. 1 . 2. 3 of an 8x8 DCT coefficient two dimen- 
sional an-ay representing a HDTV macroblock. Such a 
case is illustrated in Fig. 38 where Xs are used to repre- 
sent DCT coefficient values that are processed by the IQ 
circuit 122 and zeros are used to represent the prese- 
lected DCT coefficients that are set of zero. 

It should be noted that it is possible to select DCT 
coefficients other than those illustrated in the embodi- 
merrts of Figs. 3 A and 3B to be set to zero depending on 
the desired innplementation. 

The IDCT circuit 1 24 receives the data output by the 
inverse quantization circuit 122 and performs an inverse 
DCT operation on the received data. Because, in accord- 
ance with the present invention, preselected DCT coef- 
ficients were intentionally set to zero, the inverse DCT 
circuit need not include circuitry for processing those 
coefficients. Thus, as with the inverse quantization circuit 
122. the IDCT circuit 124 can be implemented using a 
relatively simple circuit. 

The output of the IDCT circuit 124 is coupled to the 
irput of the downsampler 126. The downsanpler 126 is 
used to downsample the data corresponding to each pic- 
ture prior to storage in the frame buffer 1 18. As a result 
of the downsampling operation, the amount of data 
required to represent a video frame, is substantially 
reduced. For example if the downsampler 126 is inple- 
mented to remove half of the digital samples used to rep- 
resent a picture, the annount of data that would have to 
be stored will be reduced by a factor of approximately 
two substantiaHy reducing the amount of memory 
required to implement the frame buffer 1 18. 

Referring now briefly to Fig. 2A there is illustrated 
another enr±K>diment of the decoder according to the 
present invention as indicated by the reference numeral 
200. Like numbered elements in Figs. 1 and 2A are the 
same or similar to each other and thus will not be 
described again in detail. In the decoder 200, a small 
buffer 202. e.g.. a Ixrffer c^sable of storing the data cor- 
responding to several lines of a video frame, in combi- 
nation with a low. pass filter 204. is used to couple the 
output of the IDCT circuit 124 to the input of the down- 
sarrpler 1 26. The buffer 202 stores video data represent- 
ing se/eral scan lines of a picture which is then low pass 
filtered by the low pass filter 204 prior to downsampling 
to reduce the blockiness in the image that might other- 
wise result from the downsampling operation. 



Referring again to Fig. 1 it can be seen that the out- 
put of the downsampler 1 26 is coipled to a first input of 
the summer 128 and a first input of the switch 129. A 
second input of the summer 1 28 is coupled to the output 

5 of the MCP circuit 130 which supplies motion compen- 
sation information to the summer 128 to be combined 
with received P- and B- video frames. The video data 
output by the summer 128 represents a video frame that 
has been processed using nrration compensation data to 

10 include the relevant video information from preceding 
and/or sii>sequent anchor frames. The output of the 
downsampler 126, on the other hand, represents either 
fully decoded picture data or decoded prediction residu- 
als in the cases of intraframe coded nr^croblocks and 

,5 non-interframe coded maaoblocks respectively. The 
switch 129 is controlled, as is known in the art. to selec- 
tively couple the output of the downsampler 126 to the 
input of the frame buffer 1 18 when the received video 
macroblock was not encoded using motion compen- 

20 sated prediction, e.g.. when the video data output by the 
downsampler was intra-f rame coded. However, when the 
video data output by the downsampler 1 26 was encoded 
using nation compensated prediction, e.g.. when it is B- 
frame or P- frame data, the output of the sunrvner 128 is 

2s coupled by the switch 1 29 to the input of the frame mem- 
ory 118- 

The received downsampled, decompressed video 
frames are stored in the frame memory 118. The frame 
memory 1 1 8 has a first output coupled to an input of the 
30 MCP circuit 1 30 and a second output coupled to an input 
of the post processor circuit 1 36. In tNs manner the MCP 
circuit is supplied with downsampled video frames to be 
used as anchor frames for motion compensation pur- 
poses. 

35 Refen^ing again briefly to Fig, 2A. it can be seen that 
the decoder 200 contains additional circuitry for reducing 
the amount of data required to be stored by the frame 
buffer 118. i.e.. a least significant brt TLSB") truncation 
circuit 208. The LS8 circuit 208 couples the output of the 
40 switch 129 to the input of the frame buffer 1 18. MPEG 
assumes that 8 bits will be used per pel when represent- 
ing a video frame. In accordance with the present inven- 
tion, in order to reduce frame buffer memory 
requirements, one or more least significant bits are trun- 
ks cated from each set of bits representing a pel . For. exam- 
ple, in one embodiment, the LSB truncation circuit 208 
truncates the data representing a pel from 8 bits to e.g.. 
6 or 7 bits. When the data corresponding to each pel is 
truncated to 6 bits a data savings of approximately 25% 
so results. 

The amount of truncation performed may t>e imple- 
mented as a function of the type of frame to wfiich the 
data corresponds. Because reduction in the resolution 
in anchor frames will impact the quality of fran^ which 
55 use these frames for motion compensation purposes, it 
may be desirable to reduce the resolution of these 
frames less than frames, such as B- frames, which do 
not irrpact the quality of other frames. Accordingly, in one 
entxxliment anchor frame data. e.g.. I- and P- video 
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frame data, is truncated to 7 bits per pel while data which 
is not used for anchor frames, e.g.. B- frame data, is trun- 
cated to 5 bits perpel- 

When LSB truncation is used to reduce frame buffer 
memory requirements, in order to maintain MPEG com- 
patibility it is necessary to represent the stored data in 
an 8 bit per pel format. Accordingly, the outputs of the 
frame buffer 1 18 are coupled to least significant bit pad- 
ding circuits 206. 207 which pad the least signrticant bits 
of the data output by the frame buffer 1 18 with zeros to 
insure that the data con-esponding to each pel of a frame 
is represented by 8 bits. The output of the LSB padding 
circuit 206 is coupled to the input of the MCP circuit 
thereby providing the MCP circuit with downsampled 
video frame data. The output of the LSB padding circuit 
207 is coupled to the input of an optional low pass filter 
209 which, in turn, is coupled to the post processing cir- 
cuit 136. The optional low pass filter 209 reduces the 
effect of the quantization noise generated as a result of 
the truncation arxJ padding process but also results in a 
slight reduction in the resolution of the video frame being 
supplied to the post processing circuit 136. 

The operation of the MCP circuit 130 illustrated in 
Ftgs- 1 and 2A. which is designed to operate using 
rrxjtion vectors generated in accordance with the MPEG 
standard, will now be desaibed. 

MPEG supports the use of motion compensation 
and, in particular. P- frames and B- frames each of which 
rely on arrother frame as an anchor frame. The motion 
vectors receive as part of the video data stream sup- 
plied to the preparser 112 were calculated at encoding 
time for use with full resolution video frames as opposed 
to downsampled video frames. 

In accordance with the present invention, the down- 
sanpled frames supplied by the frame buffer 1 18 to the 
MCP circuit 130 are upsampled. e-g., on-the-fly. interpo- 
lated and then downsampled prior to generating predic- 
tions based on the motion vectors. In this manner the 
rrxdtion vectors wtiich were originally generated based 
on full resolution video frames are effectively applied to 
dowr^mpled video frames. 

As illustrated ffi Fig. 1. the MCP circuit 130 conrv 
prises an upsampler 131 which has an input coupled to 
the first output of the frame buffer 118. and an output 
coupled to the input of a half pel interpolation circuit 1 32. 
The output of the hatf pel interpolation circuit 1 32 is cou- 
pled to the input of a motion compensated prediction 
module 1 35. The output of the motion compensated pre- 
diction rTKXjule 135 is coupled to a downsampler 133 
which, in turn, has an output coupled to the input of the 
average precfictions circuit 134. The output of the aver- 
age predictions circuit 134. which serves as the output 
of the MCP circuft 130. is coupled to the second input of 
the summer 128. 

Various known upsampling and down sampling fil- 
ters may be used for the upsampier 131 and downsam- 
pler 133. respectively. However, for best results, 
simulations have shown that it is very important that the 
downsarrpler 133 use a downsampling process which is 



the exact inverse of the upsampling process since even 
small differences are made noticeatMe in inter-coded 
video frames after many generations of predictions- 
There are several ways the required upsampling and 

5 downsampling can t>e accomplished. For example, in 
one enrbodiment a bilinear upsampling filter with a 3x3 
kernel and a center coefficient = 1 .0 is used as the 
upsampler 131 and a downsampler which does not per- 
form filtering is used as the downsampler 133. This 

10 selection of filters allows for reasonatrfy good upsam- 
pling with no additional t>lurring during downsamplir>g. 
This filter arrangement works especially well when the 
residual image to be accumulated during predication has 
been lowpass filtered prior to accumulatior^. Such low 

15 pass filtering is. as discussed above, achieved to some 
extent as a result of the preparser 112 removing higher 
order AC OCT coefTidents from nnacroblocks which 
include more than the preselected maximum ruimber of 
DCT coefficients per macroWock. 

20 In another embodiment an upsampling fitter with a 
kernel size of 2x2 pixels is used as the upsampler 131 
and a downsampling fitter is used with tiie same kernel 
size as the upsampling fitter and a transfer function such 
that the convolution of the upsarrpling filter transfer fi»K- 

25 tion and the downsampling filter ti^r^r function pro- 
duces a 3x3 kernel whose center coefficient is 1 .0. 

While it is preferred that identical downsampling cir- 
cuits be used for the downsampler 126, 123 there is no 
requirement that this in fact be the case. However, it is 

30 irrportant tiiat the downsampler 133 be matched to the 
upsanpler 131, as discussed above, to provide good 
results. 

MPEG permits the portion of an image correspond- 
ing to a maaobkx:k to be encoded eittier on an interlaced 

35 or non-interlaced manner. Macroblocks corresponcfing 
to non-interlaced pictures are always coded using a 
frame OCT format and with macroblocks conresponding 
to interlaced pkrtures b^ng coded using either a fieW or 
frame DCT format 

40 In accordance with the present invention, downsam- 
pling is performed separately on each individual macrob- 
lock. Downsampling of macrobtocks corresponding to 
non-interlaced pkrtures is performed by the downsam- 
pler 126 on a frame basis. 

45 However, with regard to macrobtocks corresponding 
to interlaced pictures downsampling can be perfomried 
on either a fiekl or frame basis for each macrobtock. 

As discussed above, the downsampler 1 26 performs 
downsarr^ing on norvinterlaced pkrtures on a frame 

50 basis. However, with regard to interiaced pictures it can 
be performed on either a f ieW or frame basis. Thus, in 
tiie case of macrobtocks of interlaced ptotures one is 
confronted with a deciston whether the downsampling. 
performed by the downsanplers 126, 133, should be 

55 done on a field or on a frame basis. Field based down- 
sampling preserves the greatest degree of temporal res- 
olution whereas frame based downsarrpling has the 
potential of preserving the greatest degree of spatial res- 
olution. In one enrtxximent of the preserrt invention pic- 
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tures are downsampled in a consistent manner either on 
a field or frame basis when interlaced pictures are 
received. Accordingly, in one ent)odiment both inter- 
laced and non-interlaced pictures are downsampled on 
a frame basis. While in another embodiment non-inter- 
laced pictures are downsampled on a frame basis while 
interlaced pictures are downsampled on a field basis. 

In another slightly more elaborate embodiment, the 
downsampler 126 performs downsampling on a inter- 
laced pictures on both a field and frame basis with the 
particular method depending on the individual macrob- 
lock being processed. 

Field based downsampling is most appropriate 
when there is not much high frequency content in the 
macroblock being downsampled and/or there is a great 
deal of motion associated with the macroblock. On the 
other hand, frame based downsampling is most appro- 
priate when there is significant high frequency content in 
the macroblock being processed and/or little motion 
associated with the macroblock- Accordingly, in one 
en^Dodiment. the downsampler 126 assesses the 
desaibed characteristics for each macroblock of an 
interlaced picture and dynamically determines the most 
appropriate method for downsampling the individual 
macroblock. 

In yet another embodiment which supports down- 
sanpling on both a field and frame basis but which 
requires relatively little circuitry for deciding which down 
sanpling approach to use. the downsampler 126 merely 
follows the choice made by the encoder when it originally 
encoded the macroblock being downsampled since the 
criteria used for previously deciding whether to encode 
the macroblock on a field or frame OCT basis is similar 
to the criteria which should be used to decide v^rtiether 
to downsarrple on a field or frame basis. 

Accordingly, in one embodiment of the present 
invention the downsampler 126 downsamples macrob- 
locks corresponding to an interlaced picture on a frame 
basis if they were encoded on a frame DCT b^ls and 
downsamples them on a field basis if they were encoded 
on a field DCT basis. 

While the downsampling of frames prior to storage 
in the frame buffer 114 reduces frame buffer memory 
requirements, it has the disadvantage of requiring the 
use of the upsampler 131. half pel interpolation circuit 
1 32 and downsampler 1 33 in order to take full advantage 
of the motion vectors. Such circuitry slightly increases 
the processing requirernents as compared to those of 
standard MCP circuits 15 since the anchor picture data 
must be upsarrpled prior to irrterpolation if half-pel res- 
olution is to be achieved. It should be noted that as an 
alternative to upsampling. interpolation, and then down- 
sampling, a less complicated and thus less costly way to 
perform motion conpensation, with somewhat lower 
quality results, is to divide the value of each original 
motion vector by the integer decease in resolution and 
take the average integer result of the division operation 
as the new motion vector value. 



While downsampling thus tends to increase MCP 
processing requirements slightly it has the advantage of 
reducing the conplexity of the input/output circuitry to the 
MCP circuit because the data bandwidth into and out of 
5 the MCP circuit 1 30 is reduced as a resurt of the down- 
sampling by the same factor as the buffer frame memory 
requirements. 

In accordance with one embodiment of the present 
invention, in order to reduce the processing and band- 
70 width requirements of the MCP circuit 1 30. and thus the 
complexity of the MCP circurt 130, bi-directional and 
dual-prime prediction is avoided. Thus eliminating the 
need for the average predictions circuit on the ability to 
process to anchor frames simultaneously Once a picture 
15 has been upsampled by the upsampler 131 interpolation 
is preformed by the half pel interpolatk>n circurt- The 
fTiotion compensated prediction rrxxjule then actually 
applies the motion vectors to the upsampled picture 
which is then downsampled by the downsarrpler 133. 
20 When the picture being processed is a bi-directionally 
coded picture, the upsampling interpolation, motion and 
prediction operations will be performing on two anchor 
frames which are then averaged together by the average 
predictions circuit to generate the single picture output 
25 by the MCP circuit 1 35. 

This is accomplished, in accordance with one 
enixxiiment of the present inventk>n by i^ng only a sin- 
gle reference or anchor picture for prediction purposes 
even if the macroblock type flag associated with a mac- 
30 roblock being processed indicates that rt is a bi-direc- 
tional coded wacrctikxk, i.e.. a macroblock that was 
coded based on the content of two different pictures. 

In such an errtxxliment when the bi<lirectionally 
coded macroblocks are received, the MCP circuit 130 
35 performs motion compensation using only one reference 
despite the fact that the macrobock type flag associated 
with the individual macroblock being processed in the 
data stream indicates that motion compensation shoukJ 
be performed using two references. 
40 In an exemplary embodiment, the i^>sampler circuit 
131 detects when a bi-directionally or dual coded mac- 
roblock is being processed, selects which of the two pos- 
sible anchor pictures associated with the macroblock to 
use for predication purposes and then upsamples only 
45 the selected anchor picture. 

In one embodiment the upsampler circuit 131 
selects which of the two possible anchor pictures to use 
for predication purposes by selecting the anchor picture 
that is temporally ctoset to the cun-ent picture and thus 
50 macrobk>ck being processed. If the two anchor pictures 
are tenporally equidistant, the average predictions cir- 
cuit 134 selects the earlier of the two possible anchor 
pictures to thereby select the one of the two anchor pic- 
tures which will have suffered the least degradatfon due 
55 to drift. 

Since errors do not propagate from B-pictures. 
because they are not used as anchors for subsequent 
pictures, a fairly large degree of tSstortion, i.e. . of the type 
caused by using only a single anchor picture for predic- 
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tion purposes, can be tolerated with regard to B-pictures. 
However, the use of a single anchor picture for prediction 
purposes with dual-prime coded sequences may result 
in somewhat less satisfactory results because prediction 
errors may propagate when dual-prime coded 
sequences are used. 

Referring now to Fig. 2B there is illustrated a video 
decoder circuit 300 according to the present invention 
capable of decoding HDTV signals at less than full HDTV 
resolution and SDTV signals at full SD resolution. As 
illustrated, the decoder circuit 300 is similar to the 
decoder circuit 100 but includes additional circuitry, i.e.. 
a HDTV/SDTV mode control circuit 310 for controlling 
the decoder circuit's mode of operation, a set of switches 
301, 302. 303 and a multiplexer 341 for bypassing the 
decoder's preparser 112. downsanpler 126. upsampler 
1 3 1 . and downsanrpler 1 33 when the HDTV/SDTV mode 
control circuit 310 determines that an SDTV signal is 
being received. When the HDTV/SDTV mode control cir- 
cuit 310 determines that a HDTV signal is being received 
it controls switches 301 , 302. 303 and multiplexer 341 to 
connect the circuit elements in generally the same man- 
ner as illustrated in the embodiment of Fig. 1 so that the 
data reduction features of the decoder circuit 300 of Fig. 
28 are operative. 

The HDTV/SDTS/ mode control circuit 310 receives 
as its input the variable length encoded bitstream. From 
analyzing the data in the bitstream. e.g.. header informa- 
tion indicating the resolution of the video frames being 
received, it determines if it is receiving HDTV data. i.e.. 
data of a higher resolution than it can decode withput 
performing data reduction operations on the received 
data, or SDTV data. i.e. data which the decoder 300 can 
fully decode and store without having to perform any data 
reduction operations. 

The HDTV/SDTV nxxie control circuit 310 gener- 
ates a mode control signal which is supplied to a position 
control input of the switch 301 which is part of the pre- 
parser module 31 2. a position control input of the swvitch 
302 which is part of a downsampler module 326 and a 
position control input of the switch 303 and multiplexer 
341 which is part of a motion vector interpolation module 
333. As discussed above, the output signal of the 
HDTV/SDTV mcxJe control circuit 310 operates to switch 
each of the modules 312. 326. and 333 between a first 
mode of operation. e.g.. a SDTV nrx)de of operation, to 
a second mode of operation, e g. . a HDTV nrxxle of oper- 
ation. In such a manner, the decoder 300 is controlled 
by the HDTV/SDTV nrxxJe control circuit to perfomi data 
reduction to reduce the resolution of the received frames 
during the second mode, i.e., the HDTV nxxJe of opera- 
tion, but not during the first mode of operation, i.e. the 
SDTV mode of operation. 

While the frame buffer 118. coded channel buffer 
116. IQ circuit 122. IDCT circuit 124. MCP circuit 330 
and SP arxJ VLD circuit 120 are generally the same in 
the decoder 300 as the decoder 1 00. various features of 
each of these decoder elements will be discussed as 
they relate to the particular embodiment illustrated in Fig. 



2B, e.g., when they are implemented as part of a decoder 
which is capable of decoding both a television signal of 
a first resolution and a second television which is of lower 
resolution than the first television signal, e.g.. HD and 
5 SD television signals. 

The preparser 1 12. in the embodiment illustrated in 
Fig. 2B. is designed to reduce the data rate of a received 
HDTV signal to approximately that of a SDTV signal. 
Thus, the preparser 112 lirrrts the number of run- 

10 length/amplitude symbols per coded macroWock and the 
nurrt)er of bits per macroWock so that the SP and VLD 
circuit 120 can function at approximately the same rate 
when processing HDTV signals as when processing 
SDTV signals, e.g., at the MP@ML specified maxirrujm 

15 data rate. 

This reduces the memory requirements for the 
coded data buffer 1 1 6 which serves as a channel buffer. 
Thus, the decoder 300 uses a channel buffer that meets 
the memory requirements for a SDTV decoder but which 

20 is smaller than that required for a fufly MPEG compliant 
HDTV decoder. U.S. ATV specifies a channel buffer of 8 
Mb. whereas orrfy 1.5 Mb are required to implen^ a 
MP@ML channel buffer. Because the MP@ML specifi- 
cation requires 830 Kbits more for a frame buffer than 

25 are required for decoding reduced resolution video pic- 
tures in accordance with one embodiment of the present 
invention, tNs 830 Kbits of the frame buffer which are not 
used for picture storage during HDTV mode operation 
can be used to supplement the memory of the channel 

30 buffer when operating in HDTV mode. Thus, in accord- 
ance with the present invention, the coded data buffer is 
inrplemented using approximately 2.33 Mbits which are 
also required to inrplement a full resolution SD decoder. 
It is the responsibility of the preparser 1 1 2 to insure 

35 that the coded channel buffer 1 1 6 does not overflow dur- 
ing HDTV mode operation. This is achieved by. e.g.. the 
preparser 112 discarding the less useful DCT coeffi- 
cients in order to prevent the smaller than full size HDTV 
channel buffer 1 16 from overflowing. 

40 While, the SP and VLD unit 1 20 is designed to oper- 
ate at the same general data rate as a SDTV data rate, 
it is also designed to accommodate the larger rar^ge of 
parameter values allowable for HDTV coding, as com- 
pared to SDTV decoding, e.g.. frame rate, picture hori- 

45 zontal size and picture vertical size. Thus, the SP and 
VLD circuit 120. because of the larger range of parame- 
ter values requffed to support both HDTV and SDTV 
decoding, is slightly more complicated then a standard 
SDTV syntax parser and variable length decoder circuit 

50 in this regard. 

As described above, the output of the syntax parser 
1 20 is coupled to the input of the IQ circuit 1 22 which, in 
turn, is coupled to the IDCT circuit 124. The IQ and IDCT 
circuits 122. 124 of the decoder 300 wiO now be 

55 described. 

MPEG MP@ML allows for pixel rates of up to 10.4 
million pixels per second. US. ATV allows for pixel rates 
of up to 62.2 mtllion pixels per second. 
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remove OCT coefficients from each irxiividual mac- 
roblock until the number of bits used to represent the 
individual macroblock does not exceed a predeter- 
mined maximum number of bits. 

5 

3. The decoder of daim 1 . wherein the channel buffer 
is further coupled to the preparser by a feedback line 
and wherein the channel buffer signals the preparser 
to control the output data rate of the preparser as a 
function of the amount of data stored in the channel 'O 
buffer. 

4. The decoder of claim 1 . further comprising 

a syntax parser and variable length decoder 
circuit coupled to the channel buffer; and '5 

an inverse quantization circuit coupled to the 
syntax parser and variable length decoder circuit for 
inverse quantizing only a limited preselected 
number of OCT coefficients per macroblock and for 
outputting zeros corresponding to the OCT coefffi- 20 
dents of the macroblock which are not inverse quan- 
tized, wherein the limited number of OCT 
coefficients per macroblock is less than a maximum 
• possible nunrtoer of OCT coefficients per macrob- 
lock. 

5. The decoder, circuit of claim 4. wherein the limited 
preselected number of OCT coefficients per mac- 
roblock is less than 11 and wherein the nnaximum 
possible nunt>er of OCT coefficients per macroblock 30 

is 64. . 

6. The decoder circuit of claim 4, further comprising an 
inverse. DCt circuit coupled to the inverse quantizer 
circuit for performing an inverse DCT operation on 35 
the lirnited preselected number of DCT coeffidents 
per macroblock. 



40 



7, The decoder circuit of claim 1 . further connprising: 
a syntax parser and variable length decoder 
circuit coupled to the channel buffer; 

a downsampler coupled to the syntax parser 
and variable length decoder circuit for downsam- 
pling the video pictures output by the syntax parser 
to ther^y reduce the anxxint of resolution of each 45 
picture and the amount of video data used to r^re- 
sent each video picture. 



50 



8. The decoder circuit of daim 7. wherein the pictures 
further include motion vectors, and wherein the 
decoder circuit further comprises: 

a frame buffer coupled to the downsampler 
for storing the reduced resolution pictures generated 
by the downsampler; and 

a nxrtion corrpensation drcuit coupled to the 5S 
frame buffeir. the motion compensation drcuit indud- 
ing: 



i. an upsampler circuit for upsampling the 
reduced resolution pictures to generate predic- 
tion reference pictures; 

ii. an interpolation circuit for interpolating the 
predication reference pictures; 

iii. a motion compensated prediction module tor 
applying the motion vectors to the prediction ref- 
erence pictures to generate new pictures; and 

iv. a downsampler drcuit coupled to the motion 
compensated prediction module for downsam- 
pling the new pictures to the same resolution as 
the pictures stored in the frame buffer. 

9. A method of decoding variable length encoded dig- 
ital video data including pictures, to a resolution less 
than the full resolution of the pictures, each picture 
including a plurality of macroWocks. each maaob- 
lock including a plurality of OCT coeffidents, the 
method comprising the steps of: 

receiving digital video data; 

parsing the digital video data to identify mac- 
roblocks induded in the digital video data arxl the 
DCT coefficients induded in the identified macrob- 
locks; 

discarding from the digital video data the DCT 
coefficients included in each macroblock that 
exceed a preselected nunrtoer of DCT coeffkaents; 
and 

storing the digital vkJeo data in a coded data 

buffer. 

1 0. The decoding method of daim 9, wherein each DCT 
coefficient is represented by a number of bits and 
wherein each macroblock is represented by the total 
number of bits representing the DCT coefficients 
included in the macrobiocK the method further com- 
prising the step of: 

prior to storing the digital video data in a 
coded data buffer, discarding DCT coeff iciertts from 
each identified macroblock until each identified mac- 
roblock is represented by a total nun^er of bits that 
is less than a preselected maxinuim number of bits. 

1 1 . The method of claim 1 0. further comprising the steps 
of: 

monitoring the amount of data stored in the 
coded data buffer to determine coded data buffer 
occupancy; and 

dynamidy adjusting the total number of bits 
aikwed per macrobkx^t from me preselected maxi- 
mum number to a smaller number as coded data 
buffer occupancy increases to insure that the coded 
data buffer does not overf tow. 

1 2. The method of claim 9, further comprising the steps 
of: 

syntax parsing and varial5le length decoding 
the video data stored in the coded data buffer to gen- 
erate a variable length decoded video data stream 
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including macrobiocks. each macroblock including a 
plurality of OCT coefficients; and 

inverse quantizing a preselected number of 
DCT coefficients included in each macroblock of the 
variable length decoded video stream and output- 
ting zero values corresponding to the maaoblock 
DCT coefficients which are not inverse quantized. 

1 3. The method of claim 1 2. further comprising ttie step 
of: 

performing an irrverse discrete cosine trans- 
form operation on the inverse quantized video data. 

1 4. The method of claim 1 3, further comprising the steps 
of: 

downsampling the video data to reduce the 
resolution of the inverse quantized video data and 
to reduce the amount of menx)ry required to store 
the inverse quantized video data; and 

storing the video data in a frame buffer. 

1 5. The metfxxi of claim 1 4. further comprising the steps 
of: 

Lpsarrpling the video data stored in the frame 

buffer: 

interpolating the upsampled video data; per- 
forming motion compensated prediction on the 
upsampled video data; and 

downsampling the upsampied video data 
upon which motion compensated prediction was 
performed. 

16. The method of claim 15, wherein the downsampling 
operation performed on the video data after perform- 
ing nrxjtion connpensated prediction is an exact 
inverse of the upsampling performed prior to per- 
forming motion compensated prediction. 

17. A method of decoding digital data representing a 
video frame, the digital data representing each video 
frame including video data representing a plurality 
of pixels, the video data representing each pixel 
including a set of bits, the method comprising the 
step of: 

truncating the least significant bit from each 
set of iDits representing one of the individual pixels; 
and 

storing the truncated set of bits representing 
each one of the individual pixels in a frame txrffer. 

1 8. The method of daim 1 7. further comprising the steps 
of: 

reading data out of the frame buffer; and 
£q3pending a zero to each set of bits repre- 
senting one of the incSvidual pixels as it is read out 
of the frame txiffer. 

19. The method of daim 18. further comprising the step 
of: 



low pass filtering the data read out of the 
frame buffer. 

20. A digital picture-in-picture decoder circuit for decod- 

s ing a primary picture to be displayed in a primary 

picture area and a second picture to be displayed in 
a secondary picture area, the secondary picture 
area being smaller than the primary picture area, the 
decoder drcuit comprising: 

w a full resdution video decoder for receiving 

and decoding a first set of digital video data induding 
the primary picture; 

a first reduced resolution video decoder for 
receiving a second set of digital video data including 

75 the second picture and for decoding the received 
video data to gen^e a reduced resolution version 
of the second picture; and 

a post processor circuit coupled to the full res- 
olution decode circuit and the reduced resdution 

20 decoder drcuit, the post processor circuit operating 
to combine the primary full resolution picture and the 
reduced resolution version of the second picture irrto 
a single picture 

25 21. The digital picture-in-picture decoder drcuit of daim 

20. wherein the fiil resolution video decoder is a 
HDTV decoder and wherein the reduced resolution 
video decoder is a reduced resdution video decoder 
that generates pidures 1/4 x 1/4 the size of the fiil 

30 resdution video decoder. 

22. The digital picture-in-picture decoder drcuit of daim 

21 . furth^ comprising a second reduced resdution 
video decoder coupled to the postprocessing drcuit 

35 for decoding a reAiced resdution version of a ttiird 

picture; and 

wherein the post processor drcuit operates 

to confine the reduced resdution version of the 

third picture with the first full resdution picture and 
40 the reduced resolution version of the second pictwe 

to form the single picture. 

23. The digital picture-in-picture decoder drcuit of daim 
20, wherein the second set of digital video data 

45 indudes a plurality off macrobiocks. each maaob- 
lock induding a plurality of DCT coeffidents; and 

wherein the frst reduced resolution video 
decoder conrprises: 

preparser means for receiving the second set 
50 of digital video data and for removing from each 
macroblock DCT coeffidents that exceed a prese- 
leded maxinuim number of DCT coeffidents per 
macroblock; and 

a charwiel buffer coi^led to the preparser 
55 means for temporarily storing the video data output 
by the preparser means. 

24. The digital picture-in-picture decoder drcuit of daim 
23. wherein the first reduced resolution video 
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In accordance with the present invention. IQ circuitry 
which meets MP@ML IQ requirements is used for the IQ 
circuit 122 of the decoder 300. In such an embodiment, 
the IQ circuit 1 22 is designed to process all the OCT coef- 
ficients of a SDTV macroblock but only. e.g.. 10 or 11 
DCT coefficients of an HDTV macroblock. 

That is. the IQ circuit 122 is designed to set a prese- 
lected group of DCT coefficients of a HDTV macroblock 
to 0 while processing the remaining 10 or 1 1 DCT coef- 
ficients of each HDTV macroblock being processed. 

In one exemplary embodiment, for HDTV macrob- 
locks coded using field DCTs. the DCT coefficients of an 
8x8 macroblock are discarded by the IQ circuit 122 as 
illustrated in Fig. 3C where zeros are used to represent 
discarded DCT coefficients and X's are used to repre- 
sent DCT coeff icients which are processed by the IQ cir- 
cuit 122. For HDTV macroblocks coded using frame 
DCTs. the DCT coefficients of an 8x8 macroblock are 
discarded by the IQ circuit 122 as illustrated in Fig. 3D. 

The I DCT circuit 124. like the IQ circuit 122 is 
designed to processes only a small percentage of the 
DCT coefficients associated with a HDTV macroblock. 
e.g.. the same DCT coefficients processed by the IQ cir- 
cuit122. 

An IDCT engine capable of performing fast one 
dimensional 8 point IDCTs can be used to implement the 
2-D ibCT operation required by the IDCT circuit 124. A 
standard MP@ML IDCT engine can be used to compute 
about three 8 point IDCTs in the time allowed for IDCT 
processing of one HDTV macroblock. Accordingly, in one 
errtodimeht of the present invention, only DCTs in the 
first three columns of an HDTV macroblock are proc- 
essed by the IDCT circuit 1 24. e.g.. the DCT coefficients 
indicated by Xs in Figs. 3C and 3D. 

Accordingly, in one embodiment the IDCT circuit 124 
includes a MP@ML conpliant IDCT engine for calculat- 
ing IDCTs of the first three columns of a HDTV macrob- 
lock and all the IDCTs of a SDTV macroWock. Since, in 
accordance with the exenplary embodiment of the 
present invention, the DCT coefficients of all but the first 
three columns of a 8x8 HDTV macroblock are set to zero, 
there is no need to calculate the IDCTs for any coeffi- 
cients outside the first three columns. 

While the MP@f^L IDCT engine is used to calculate 
the column IDCTs, a IDCT engine for calculating the row 
IDCTs for a HDTV macroblock is also included in the 
IDCT circuit 124. However, in accordance with the 
present invention, this can be a relatively simple IDCT 
engine since 5 of the 8 row coefficients will always be 
zero by the preselected setting of such coefficients to 
zero- Furthermore, because only the first two or three 
columns contain coefficient values that are to be used in 
the calculation of row DCTS only 2 or 3 points need to 
t>e cateulated for each row transform. Note that in the 
embodiment which uses the 8x8 DCT arrangement illus- 
trated in Figs. 3C and 3D only 4 rows have to be trans- 
formed when no additional filtering is performed prior to 
downsanrpling by the downsampler 126. 



As an alternative method of inplementing an IDCT 
circuit 124 which uses an MP@ML IDCT engine as part 
of HD decoder IDCT circuit, the low frequency rectangle 
of coefficients from several macroblocks may be com- 
5 bined into a single macroblock as illi^trated in Fig. 3E. 
where "1 " is used to represent low-frequency coefficients 
from a first maaoWock. "2" is used to represent low-fre- 
quency coefficients from a second blocks. "3" is used to 
represent low-frequency coefficients from a third mac- 
,0 roblock and "4" is used to represent tow frequency DCT 
coefficients from a fourth macroblock. 

In accordance with this alternative emt)odiment. the 
IDCT circuit 1 24 aranges the DCT coefficients from four 
different HDTV maaoWocks into a single 8x8 btock of 
IS DCT coefficients in the described manner and a conven- 
tionat 8x8 point IDCT engine is then used to process the 
composite block of DCT coefficients. After processing by 
the 8x8 point IDCT engine relatively simple linear oper- 
atfons performed on the result of the IDCT operation are 
20 used to generate a good approximation of the IDCTs of 
the low frequency coefficients of each one of the four 
component HDTV macroblocks used to form the com- 
posite coefficient block. 

As discussed above, the video picture data output 
25 of the IDCT circuit 124 is downsanpled when the 
decoder 300 is operating in HDTV nrxxje prior to storing 
in the frame buffer 1 1 8 to thereby further reduce memory 
requirements. 

MP@ML allows for picti^es as large as 720x576 
30 pels for a total of 41 4720 pels p&r picture while US ATV 
allows for pictures as large as 1920x1080 pete per pic- 
ture. Sequences received in the 1920x1080 pel picture 
format can be downsampled by a factor of 3 horizontally 
and a factor of 2 vertically to yield a maximum resolution 
35 of 640x540 pels for a total of 345,600 pels. In such an 
entxxliment, the same amount of memory required to 
implement a full resolution SD decoder is satisfactory for 
storing a downsampled HDTV video picture. Thus, the 
memory provided for MP@ML would be adequate for the 
40 reduced resolution HD decoder as well, with an addi- 
tional 830 KBits availablefor other purposes, e-g.. for use 
as part of the coded data buffer 1 16 when operating in 
HDTV mode as descrtoed above. 

By selecting decoder conponents that are designed 
45 in the above manner, it is possible to innplement a com- 
bined low resolutfon HDTV decoder and full resolution 
SD decoder at a cost that is expected to be about 1 30% 
of the cost of a full resolution SDTV decoder The 
decoder 300 may be used as part of a television set 
50 which is designed to decode and display both HDTV and 
SDTV signals at a relatively low cost as compared to a 
system incorporating a full resolution HDTV decoder. 

The decoders illustrated in any of Figs. 1 and 2A-2B 
may also be used to implement picture-in-pfoture capa- 
55 bility in a HDTV or SDTV set without incumng the cost 
of having to provide a full resolutfon decod©- for the 
decoder used to display the picture shown inside the 
main picture of a picture-in-picture display. 
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For a MPEG compatible digital television set to oper- 
ate in a picture-in-picture nnode, two MPEG decoders 
must be used, a first decoder to decode the full resolution 
picture to be displayed on a first portion of the screen 
and a second decoder to decode the reduced size pic- 
ture displayed on a second smaller portion of the screen. 
Referring now to Fig. 4. there is illustrated a picture-in- 
picture ("PIP") decoder 400 in accordance with the 
preserrt invention. The PIP decoder 400 receives MPEG 
compliant demodulated digital video information, e.g.. 
either an SD or HD TV signal, from a tuner/demodiriator 
399 which is supplied to the input of a primary, e.g., full 
resolution decoder 401 and to the input of one or more 
reduced resolution decoders, i.e.. the first and second 
reduced resolution decoders 402, 403. 

The primary decoder 401 is responsible for decod* 
Ing the main picture of a picture-in-picture image while 
the first arxJ secorxj decoders are responsible for gener- 
ating separate images which will be displayed in a small 
area of the main picture. A separate reduced resolution 
decoder 402 or 403 is used for each additional image 
that is to be displayed in addition to the main picture. 

The output of the primary decoder 401 arxi the 
reduced resolution decoders 402, 403 is coupled to the 
input of a picture-in-picture video processing drcuit 
which operates to combine the main picture with the 
reduced resolution pictures output by the reduced reso- 
lution decoders 402. 403 prior to the resulting combined 
picture being displayed. 

In one embodiment of the preserrt invention the size 
of the reduced resolution pictures incorporated into the 
main pcture is selected to be 1/4 x 1/4 the size of the 
normal pHCture. In such an embodiment, each MPEG 8x8 
pixel block need only be decoded to a size corresponding 
to a block of 2x2 pixels. 

The cost of the IDCT circuit 1 24 used in the reduced 
resolution decoders 402. 403 can be substantially 
reduced in accordance with the present invention by per- 
forming the IDCT operations on only 2x2 blocks as 
opposed to 8x8 blocks. This is achieved by, e.g.. retaining 
arxj processing only the tpper left 2x2 block of OCT coef • 
f iderrts of each 8x8 OCT coefficient block of a HDTV pic- 
tu-e with all the other DCT coeff idents being set to zero. 
Accordingly, in such an enrtxxlimerrt. the IDCT circuit 
cost is reduced to approximately the cost of a drcuit 
which can perfbmri a 2x2 IDCT as opposed to an 8x8 
IDCT 

The lO drcuit 122 of the reduced resolution decod- 
ers 402, 403 can be simplified in a sinr^lar manner with 
the IQ circuit 122 operating only on a 2x2 block of DCT 
coeffidents. i.e.. 4 coeffident values, as opposed to 64 
DCT coeff icierrt values that form an 8x8 DCT coeffident 
block. 

In accordance with the present invention, the 
reduced resolution decoders 402. 403 store the low res- 
olutk>n frames in thetr reckKed size. Thus, by using 2x2 
DCT coeffident blocks the size of the frame buffos can 
be about 1/1 6th of the size that would be required if 8x8 



DCT coefficient t)locks. i.e., full resolution Ijlocks. were 
stored. 

In addition to reducing frame buffer size the pre- 
parser 1 1 2 serves to reduce channel buffer requirements 

5 and the complexity of the SP and VLD drcuit by, in 
accordance with the PIP embodiment, discarding ampli- 
tude/runlength pairs corresponding to each macroblock 
that exceed the 4 amplitude/runlength pairs that are used 
by the reduced resolution PIP decoder 402 or 403. 

10 In a PIP embodment the MCP drcuit 130 can per- 
form interpolation on the reference frames for maaob- 
locks whose motion vectors are not evenly divisit)le by 
four to achieve good prediction results. For a less expen- 
sive and somewhat lower quality implementation, motion 

15 vectors could be rouxJed to the nearest factor of 4 and 
applied wtttiout interpolation. 

While the above PIP decoder embodimerTt of the 
present invention has been descr%>ed in tfie context of 
using 1/4x1/4 size pictures for the secondary pictures 

20 inserted into the primary picture, other size pictures 
could also be generated at reduced cost e.g.. 1/2 x 1/2 
size pictures, using the decoder 1 00 descr bed above. In 
such a case, the downsamplir>g would have to be per- 
formed in a manner that woukl produce the desired 

25 reduction in size and data rate. 

It should also be noted that the arraf>gemerrt illus- 
trated in Rg. 4 can k^e used to provide picture-in-picture 
capability in a SDTV receiver as well as a HDTV receiver. 
Furthermore, the illustrated PIP decoder anangement is 

30 in no way limited to a spedf k: degree of resolutkxi with 
regard to the primary decoder 401 and only requires that 
the secondary decoder(s) 402. 403 be implemented as 
reduced resolution decoders as compared to the resolu- 
tion supported by the primary decoder. 

35 

Claims 

1. A decoder drcuit for decoding digital video data 
induing pictures at a resolution less than the full 

4<7 resotutk>n of the pictures, each pk:ture induding a 
plurality of macroblocks. each macrobkx:k induding 
a plurality of DCT coeffidents. the decoder drcuit 
comprising: 

preparser means for receiving the digital 

45 video data and for rerrioving from each macroblock 
DCT coefficients that exceed a preselected maxi- 
mum number of DCT coeffidents per nnacrobk>ck: 
and 

a channel buffer coupled to the preparser 
so means for terrqxvarily storing the vkleo data output 
by tfie preparser means. 

2. Thedecoder of claim 1. 

wherein each DCT coeffk:ient is represented 
55 by a number of t>fts and wherein each macrctkxk is 
represented by tfie total number of bits representing 
the DCT coeffidents induded in the macroblock. 
and 

wherein the preparser means operates to 
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decoder further comprises: 

a syntax parser arKi variable length decoder 
circuit coupled to the channel buffer for variable 
length decoding the second set of digital video data; 

a downsampler coupled to the syntax parser 5 
and variable length decoder circuit for downsam- 
pling the variable length decoded video data to 
thereby reduce the amount of resolution of the sec- 
ond picture and the amount of video data used to 
represent second picture. ^0 

25. A method of decoding variable length encoded dig- 
rtal video data including slice stuffing data, start code 
zero stuffing data, and macroblodis. the macrob- 
iocks representing pictures, to a resolution less than is 
the full resolution of the pictures, the method com- 
prising the steps of: 

receiving the digital video data; 

parsing the digital video data to identify bit 
slice stuffing and start code zero stuffing data; 20 

discarding the bit slice stuffing data and start 
code zero stuffing data; and 

storing the remaining video data in a channel 

buffer. 

25 

26. The method of claim 25. further comprising the steps 
of: 

variable length decoding the video data 
stored in the channel buffer to generated fixed length 
video data; ancJ 

downsampling the fixed length video data. 

27. A method of decoding digital video data including an 
interlaced picture, to a resolution less than the full 
resolution of the interlaced picture, the interlaced 35 
picture including a plurality of maaoblocks encoded 

in accordance with one of a field OCT basis or a 
frame DCT basis, the method comprising the steps 
of: 

downsampling the macroWocks encoded on 40 
a field DCT basis on a field basis: 

downsampling the macroblocks encoded on 
a frame DCT basis on a frame basis; 

storing the downsampied macroblocks in a 
frame buffer. 

28. The method of claim 27. wherein the video data fur- 
ther .comprises a non-interlaced picture, the method 
comprising the steps of: 

downsampling the non-interlaced picture on so 

a frame basis. 
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